Analysis of instantaneous F0 contours from two speakers mixed signal using zero frequency filtering

نویسندگان

  • Bayya Yegnanarayana
  • S. R. Mahadeva Prasanna
چکیده

Instantaneous fundamental frequency (F0) in voiced speech can be obtained from the sequence of epochs corresponding to the instants of significant excitation. The epoch sequence can be derived using the recently proposed epoch extraction method based on zero frequency filtering. The epoch extraction method is robust against additive noise degradation. But in a multispeaker mixed signal, the degradation is caused due to overlapping impulse-like excitations of two or more speakers. The feasibility of extracting the instantaneous F0 contours from the two speaker mixed signal using zero frequency filtering is studied in this paper. The present study is based on deriving speakerspecific Hilbert Envelope (HE) signal which emphasizes peaks due to impulse-like excitation of one speaker and suppresses peaks due to other speaker. The epochs from this speaker-specific signal are obtained using the approach based on zero frequency filtering. The results of the proposed method is demonstrated for three different cases of mixed signals of two speakers data.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mandarin Tones Recognition by Segments of Fundamental Frequency Contours

Mandarin is one of the tonal languages. In Mandarin tones, there are four lexical tones (tone 1 to tone 4) with four different fundamental frequency (f0), such as flat and high, rising, falling and then rising, and falling, respectively. In order to process signal on lexical tone, at first we have to identify which tone is. We would like to find out an efficient approach to identify Mandarin to...

متن کامل

Statistical Analysis of Fundamental Frequency Based Features in Speech under Stress

A significant part of the non-linguistic information carried in speech refers to the speaker and his/her internal state. This study investigates sixteen features based on fundamental frequency of speech F0 in order to detect stress in speakers. The most effective features resulting from experiments are presented here. The total frequency ranges of F0 across specific short-time speech segments c...

متن کامل

Analysis and modeling of fundamental frequency contours of hindi utterances

This paper describes the results of a preliminary study on the applicability of the command-response model to F0 contours of spoken Hindi, an official language of India with almost 400 million native speakers in the world. Analysis of observed F0 contours of a number of utterances by two native speakers indicated that the model with provisions for positive and negative accent commands applies q...

متن کامل

The effect of word frequency and neighbourhood density on tone merge

This paper studies the effect of word frequency and neighbourhood density on lexical tone merge in Dalian Mandarin. Monosyllabic words with two lexical falling tones (i.e. Tone1 and Tone 4) are produced by 60 native speakers from two different generations (middle-aged vs. young). The stimuli consist of three conditions: high neighbourhood density with high word frequency (HDHF), high neighbourh...

متن کامل

Using instantaneous frequency and aperiodicity detection to estimate F0 for high-quality speech synthesis

This paper introduces a general and flexible framework for F0 and aperiodicity (additive non periodic component) analysis, specifically intended for high-quality speech synthesis and modification applications. The proposed framework consists of three subsystems: instantaneous frequency estimator and initial aperiodicity detector, F0 trajectory tracker, and F0 refinement and aperiodicity extract...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010